A Novel Optimized Language-Independent Text Summarization Technique
نویسندگان
چکیده
A substantial amount of textual data is present electronically in several languages. These texts directed the gear to information redundancy. It essential remove this redundancy and decrease reading time these data. Therefore, we need a computerized text summarization technique extract relevant from group documents with correlated subjects. This paper proposes language-independent extractive technique. The proposed presents clustering-based optimization clustering determines main subjects text, while minimizes redundancy, maximizes significance. Experiments are devised evaluated using BillSum dataset for English language, MLSUM German Russian Mawdoo3 Arabic language. experiments ROUGE metrics. results showed effectiveness compared other language-dependent techniques. Our achieved better metrics all utilized datasets. accomplished an F-measure 41.9% Rouge-1, 18.7% Rouge-2, 39.4% Rouge-3, 16.8% Rouge-4 on average three objectives. system also exhibited improvement 26.6%, 35.5%, 34.65%, 31.54% w.r.t. recent model contributed terms metric evaluation. model’s performance higher than models, especially ROUGE_2 which bi-gram matching.
منابع مشابه
Language-independent Techniques for Automated Text Summarization
Text summarization is the process of distilling the most important information from source/sources to produce an abridged version for a particular user/users and task/tasks. Automatically generated summaries can significantly reduce the information overload on intelligence analysts in their daily work. Moreover, automated text summarization can be utilized for automated classification and filte...
متن کاملA language independent approach to multilingual text summarization
This paper describes an efficient algorithm for language independent generic extractive summarization for single document. The algorithm is based on structural and statistical (rather than semantic) factors. Through evaluations performed on a single-document summarization for English, Hindi, Gujarati and Urdu documents, we show that the method performs equally well regardless of the language. T...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملLanguage Independent Summarization Approaches
In this chapter, the authors introduce monolingual and multilingual summarization and present the problem of dependence of language and linguistic knowledge of the process. Then they describe the most influential works and techniques in the field of automatic multilingual and language-independent summarization. This section is presented as a solution to solve the problem already explained. The ...
متن کاملLanguage Independent Extractive Summarization
We demonstrate TextRank – a system for unsupervised extractive summarization that relies on the application of iterative graphbased ranking algorithms to graphs encoding the cohesive structure of a text. An important characteristic of the system is that it does not rely on any language-specific knowledge resources or any manually constructed training data, and thus it is highly portable to new ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computers, materials & continua
سال: 2022
ISSN: ['1546-2218', '1546-2226']
DOI: https://doi.org/10.32604/cmc.2022.031485